#LLM Agent

Research

DCA-Bench: A Benchmark for Dataset Curation Agents
Benhao Huang, 
Yingzhuo Yu, 
Jin Huang, 
Xingjian Zhang, 
Jiaqi W. Ma
KDD-2025 DB Track (Oral)
#LLM Agent
#Benchmark
#2025

A benchmark exploring the performance of LLM Agents on detecting issues in datasets hosted on popular platforms.

paper
code